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ABSTRACT 

The paper considers how the technical, aspects of; 
interest aeasureaent can contribute to sex bias and suggests 
guidelines for evaluating interest inventories for sex bias and for 
developing interest inventories which ainiaize sex bias. An overview 
is given of interest aeasureaent techniques: in selecting a pool of 
iteas, developing scales, and noraing the scales. The issues raised 
at each step are surveyed. Interpretive practices are briefly 
discussed. Suggestions are offered for ninimizing sex bias at each of 
the three aajor technical steps in developing a new interest 
inventory, and for its interpretation. Twelve specific guidelines are 
presented in suaaary, including these: The content of interest 
inventory iteas should not iaply that any occupation or activity is 
move appropriate for one sex than for the other; scores on the save 
scales should be available for both aen and voaen; nora groups for 
occupational scales should contain aen and woaen in equal proportions 
or the effects of both groups should be statistically equalized; 
published interpretive aaterials should indicate clearly that sex 
noras for hoaogeneous scales do not iaply anything about the 
predictive validity of the scales, (AJ) 
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The world of work 1* divided into two categories, "woaen'e work" 
and **mn*9 work" and anyone who does not know the difference can consult 
such widely divergent sources ae children's prlwrs (tfonen as Words 
and Inages, 1972) » the local want-ads, or United States DefMrtMnt of 
Labor statistics (1969). Individual differences between mn and wown 
have been cited as the reason for this dichotony (Firtona, 1965) as have 
!H>Utlc8 and econosiica (Bird, 1968; Millett, 1969; Firestone, 1970). 
Actually, there are few jobs which require skills and abllltlos which ari» 
possessed exclusively by one sex or the other. In fact, there are few such 
skills (Lewis, 1968). Thus, it follows that there arc few Instances in 
which it is appropriate for an individual "to Unit... his or her consideration 
of career solely on the basis of gender." 

Historically, the oldest Interest inventory still in use, the Strong 
Vocational Interest Blank, was developed in the 1920*t and 1930*s on the 
assumption that there arc two categories of work and two categories of 
workers. Reporting on disappointing early attenpts to measure wonen'a 
interests, Strong cosnented, "The writer's hunch Is that it will be found 
that nen can be better differentia tad respecting a larger nuii>er of occupations 
than can wonan. The prinnry reaaon for this belief is that the Interest 
technique necessitates a fairly homogeneous criterion group. At the present 
tine far too SMny women enter an occupation as a atop-gap until s«rrlage. 
Conaequently, they take a Job becauae it is convenient, not because they 
intend to continue in it indefinitely. The result is that oast occupatlona 
contain a coiisiderabl(> nunber of women who would not be ther^ if they had 
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ti^lected an occupational career as men do* Any sampling of such occupations 
gives a rather heterogeneous groun of women* Occupational scales based on 
such criterion groups cannot be expected to differentiate very well**' 
(Strong, 1943, p* 129)* 

Strong do v«« loped separate inventories for men and women* His 
assumptions probably reflected tho convictions^ of his time> but they also 
influenced future researchers* The latest Mental Mi^asurement Yearbook 
(BuraSy 1972) showed that the male form of the Strong Vocational Interest 
Blank has been utilized in 1,09^ studies since it was Introduced in 1927, 
while the female form of the Strong Vocational Interest Blank had been 
utilised in 168 studies since it was introduced in 1933* About 13 times 
more research has been directed toward studying men^s interests than 
women^s interests » if we make the questionable assumntion that each 
reported study represents an e(iual unit of research* Strong assumed that 
women*s interests were difficult to nieasure and different from men^s 
interests: be seemed to convince a nisjorlty of researchers and inventory 
developers* The interest Inventories available today rest on m^ny of the 
samp assumptions that he utilized* 

One purpose of this paper is to consider how the technical aspects of 
interest measurement can contribute to sex bias in that interest inventories 
may be used to limit consideration of career on the basis of sex* Another 
purpose is to suggest guidelines for evaluating interest inventories for 
sex bias and for developing interest inventories which minimize sex bias* 

ERIC 



Overview of Interest Measurement Techniques 

There are three major technical steps in developing any Interest 

Inventory and sex bias can enter at any point* Firsts a pool of items 

are selected; second, scales are developed; finally^ the scales are 

nonaed • 
Items 

Most interest inventory items utilize occupational names and/or 

occupational activities as items « Names of school subjects » non-- 

occupational activities^ or personal characteristics are used as items 

to a lesser extent. If the items imply sexual stereotyping in occupations , 

i.e. > "polic eman" or "sales lady" > they promote sexual bias both in the 

responses they elicit and in the image of the occupation they project. 

Strong (1962) suggested guidelines for selecting good items for Interest 

inventories but his assumptions and his measurement technique did not lead 

him to include the elimination of sex bias. 
Scale Construction 

After a set of items have been developed » they are administered to a 

group of people. The responses of this group to the set of items provide 

the basic data from which interest inventory scales are constructed. There 

are basically two kinds of interest inventories and two ways to construct 

them. An understanding of the two types of scales and the way they are 

constructed is necessary to an understanding of the potential for sex bias 

in each type. Scales may be based on either external or internal criteria. 

The composition of the group which is used to provide data for scale building 

and the way that data is treated determines the type of scale which results. 
Scales based on external criteria utilize the itr»m resnonses of 

a group of people already employed in a specific occupation (the criterion 

group) to provide a comparison with either a reference group of peoole 
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eoployed In a Urge number of occupations or with the individual being 
tested. These scales are called occupational scales and named after 
the occupation represented in the criterion group. A reviev of all 
interest inventories listed by Buros in the two most recent editions 
of the Mental Measurements Yearbook (1965, 1972) revealed that only 
five of the 52 contain occupational scales based on external criterion 
groups. The five include the Kuder Preference Record - Occupational 
(Fof D), the Kuder Occupational Interest Survey (Form DD), the 
Minnesota Vocational Interest Inventory » and the Strong Vocational 
Interest Blank for Men (FormT399), and the Strong Vocational Interest 
Blank for Women (Form TW398). 

Since these inventories are widely used it is worthwhile to 
examine their scale building methods closely. The occupational scales 
of the Kuder (Form D), the Minnesota Vocational Interest Inventory and 
both forms of the Strong Vocational Interest Blank are all developed in 
• similar manner. For each item, the percentage of an occupational 
criterion group Who make each response are compared with the percentage 
of a reference group who make each response. For example, on the Strong 
Vocational Interest Blank item, "Biologist", individuals may respond 
•like", "Indifferent", or "Dislike". To determine whether that item 
should appear on a scale for female veterinarians, a coiaparison is 
made between the percentage of the occupational criterion group (female 
veterinarians) and the percentage of the reference group (Women- in- General) 
who make each of the three responses. Figure 1 illustrates how weights 
•re assigned to each response for the female Veterinarian scale. Only 
items which differentiate between the two groups by a large percentage, 
usually 15 percent or more, are included. 

The occupational scales of the Kuder Occupational Interest ^rvey 
(Form DO) also utilize criterion groups of people employed in an 



occupation. However, the responses of the individual taking the 
Inventory are compared to those of the criterion group and expressed 
«s a type of correlation coefficient between the criterion group and 
the Individual (Zytowski. 1973). No reference group is used. 

Moat interest scales based on external criterion utilise occupational 
groups of one sex or the other but not both. All interest scales which 
have been developed using external criteria rest on the assumption 
that sex is very important to building the scales. 

Scales based on internal criteria, aUo called homogeneous scales, 
are developed by using some inethod of clustering the Interest inventory 
items. The clustering method may reflect the subjective judgment of 
the test developer, the dimensions of a theoretical model, or the results 
of a factor analysis. The important difference from occupational 
scales Is that the clustering process does not make use of an external 
criterion such as membership in an occupational group m assigning 
items to scales. The group of people tested to provide basic data 
about the inventory items are not specified by occupation although they 
may be specified by age or sex. Their responses are used to provide 
information on how the items cluster. Items which are highly correlated 
for the group tested are placed together in scales. The resulting scales 
are usually more limited in number than occupational scales based on 
an external criterion and they have names which are more general 
CJMechanlcal", "Clerical", and "Social", as opposed to "Engineer." 
"Secretary", and ";; .cial Worker".) 

Another important difference between the two tynes of scales is 
that most homogeneous scales based on internal criteria are ..t sex specific 
are the occupational scales. These internally referenced scales which 
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are designed to reflect general dimensions of interest present In a 
general population, usually nake no assuotptlons about sex and bias Is 
not Introduced in the scale building process. Bias night be Introduced, 
however. If the group used to develop Item statistics for the clustering 
procedure were atypical m some way. I.e., contained only one sex or 
the other. Generally, bias Is not introduced in this way and more 
potential for bias in inventories developed using Internal criteria 

exists in scale nornlng than in scale building. 

K orming Scales 

Occupational scales based on external criterion groups of people 
employed in the occupation usually utlliae the occupational criterion 
group as a norm or comparison group. The score of an individual reflects 
some degree of similarity or dissimilarity to men or women in the occu- 
pation when an occupational scale is employed. Thus, a dagree of con- 
current validity is built into the externally referenced scales. The as- 
sumption that occupational scales require separate criterion groups by sex 
leads to acales which require separate normlng by sex if the scales are 

to make any interpretive sense. 

The norm group for internally based scales is usually the same 

type of general population group utilized in scale building. The age 
range of the norm group is the same as that of potential test Ukers, 
The scales hav« implied or demonstrated content and construct validity 
but there is neither concurrent nor predlctivtj validity associated with 
the comparison or norm group, The homogeneous acales of inventories of 
this kind present a sort of occupational typology (which may or may not 
be carefully documented In content and construct statistics). They 
show Individuals their type(s) compared with others who are similar in 
age and/or sex. It has seemed reasonable to present separate norm 
groups by sex on this kind of interest Inventory because the distributions 



of scores for nen and wonen are usually different. However » it also 

Is reasonable to ask whether the use of separate norn groups by sex 

introduces sex bias* 
Issues Raised 

This review of the techniques of Interest measurement raises many issues 
about potential sex bias in interest measurement. First* it is Important to 
to establish whether there are sex differences in response to items because 
item responses are the building blocks of interest measurement. If there are 
no sex differences at the item level » none of the interest measures which are 
built upon those items need to take account of sex. If there are aex differ- 
ences at the item level it is important to det^ mine whether they are related 
to either item content or Item format » and to determine whether separate 
pools of items are needed for men and for women* 

Another related iaaue ia whether occupational scales need to be 
developed for each aex separately. Why are snany homogeneous scales 
developed without reference to the sex of the respondents? 

It follows that we need to know whether either occupational or 
general norm groups snould be separated by sex. 

The goals of interest measurement ar^^ to help Individuals explore 
their interests in comparison with others and to promote good life 
plsnning. If any of the ptBc-^ices which seam to imply sex bias in 
interest measurement are not actually necessary to these goals » then 
aex bias does exist in interest inventories. 

Issues Related to Items 
Sex Difference in Item Response 

Campbell (in press) > has been able to use the large bank of Interest 
data at the Center for Interest Measurement Research to show that there are 
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large aex differences in reaponse to so.e ite««. auch a. "on.r... 

** """^^ operating machinery," 
-ec„««8 a roc» „Uh no„,„,. and "„pau electrical „l.i„,,. xhese 
-Uference. app..„a .ec„ae„ ^„ an. f .He ea.1, adolescent ^ara 

.nd between g„.pa of and wonen in the occupation, in „30 and 

1« 1968. sex di«e«ncea in it.„ „,non.e .taMe ove. age .„a 

tl«e and the, do not dlsapp.ar when occupation Is held constant 
.oh.nn.on and H.r»o„ 

to the «,e and fe«le ,,„„^ ^^^^^^^ ^^^^^^^^ ^^^^^ 

With occupation held conatant and .ound that an average of « „„„„ 
Of the ite^ Showed Urge .ex differences, within occupations, the 

percentage of items showing laree sex diff^.o 

*ng large sex differences varied from 30 to 58 

The percentage of it.» exhibiting large sex difference, when the 
m-Oeneral and «o»en.in-Genera, groups were co^red wa. THes. 
-.ta were ha.ed on recently tested occupational criterion grouns and 
they indicate a .ub.tantial a«,unt of difference in the way and 
wo»en re.pond to typical Interest inventory iteos. 
Separat e Item PqaIa 

sine, there are doc™.ented sex difference, m responae to interest inven- 
tory It.™, the practice of providing aeparate .eta of ite™. for each sex 
.hould he e^ined. A recent review of Burn. ,,,,,,,_n,,,,,r^ 
^or 1«3 and 1,72 ahowed that eight Inventorle. or sixteen percent of all t^e 
inventorle. Hated In those years have aeparate for« for „lea and females.' 

Not all of them were inspected but it seems dear that i 

»ceias Clear that the only reason to provide 

separate forma is that the item pool differs for . ^ 

fsfwj. uAiiers tor men and women. Thtj 

underlying a..u.^tlon .ee=s to be that ^n and „o.en play different' 
occupational role, which cannot be described by one set of Interest 



laventory ItectB. 

Where separate Item pools exist critics have cited the presence 
of Items descriptive of high level technical and business activities 
In the Iteo pools for sen and the presence of items descriptive of 
dooestic and clerical activities in the Itec pools for women* The 
Strong Vocational Interest Plank (Form T399 and Form TW398) have over 
half their items in conmon yet one of the issues that led to the 
development of the Strong Campbell Interest Inventory, to be published 
in 1974, w&s the separate item pcols. In a memo to the Strong 
Vocational Interest Blank Advisory Board regarding the need for an 
immediate revision of the Strong Blanks, combining them into one form, 
Campbell wrote in January of 1972, "The SVIB does tend to oerpptuate 
stereotypic roles for c^n or women, at the expense of voaen^ both by 
the kinds of items included, and the kind of Information provided in 
the profile," 

Since many reputable Interest inventories have been able to 
develop effective scales without using separate item pools, it can be 
concluded that separate item pools are a source of sex bias in interest 
measurement in that they call attention to sex differences unnecessarily. 

One of the Important side effects of the separate item pools on a 
well researched Inventory like the SVIB is to preclude easy comparisons 
between the item responses of men and women. Even though half the 
items are identical, they are numbered differently in the two forme 
and appear on different scales, creating ne&*ly prohibitive difficulties 
for anyone wanting to compare male and female responses. This author 
believes that systems are designed to accommijdate projected needs. 
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Th« fact that Mle-eenale comparlaona acroftt the Strong BUnki are lo 
difficult auggtatt that no one In over 4O yeara conaidered them 
iB^rtant until now, 

Xdantical Itena poola for men and woiaen a re eaaentlal to avoid 
introducing aaxual biaa and to facilitate research on aex dlfferencea 
in ita» ffttponaaa, where tex dlfferencea in itec reaponae are found, 
it la naeaatary to explore their real lotpact on Intereat Inventory 
scalaa before one drava conclualona about the effect of aex dlfferencea 
in oaaauring intereata. 
Item Content 

Intaraat itam content can imply tha "correct" sexual identity of the re- 
spondent who andoraas it. Few man would dare say they like the occupation 
"•alaaXady." Certainly itama which include tha words "wan," "woman," "girl," 
"boy," or "lady," are aaxually biaaad. Other itama which do not explicit- 
ly involve a e«xual ataraotype naverthalaaa may elicit ateraotypad images and 
raaponats bacauaa of cultural bias. For example, tha item "nurae" is prob- 
ably parealvad aa k feminine occupation and raapondad to aa a feminine occu- 
pation although tha name of tha occupation doaa not imply an "appropriate" aex. 

There haa bean aone controversy over the beat type of item content 
tot mtareit invsntorlss. Strong (1943) built his Invsntotiea with 

favsYal types of content such aa occupational nan«s, activities, school 

J* 

aubjtecs end paruonal characterlstlca. Holland (1965^ devslopsd sn 
invetitery, lha Vocational •"reference Inventory, with items which are 
all occupational titles. Be argued (1973) that occupational nanes 
ftlsulate raUabld vocational stereotypes in the tost taker. Caopball, 
(in praif) found that occupational tltlea aa itens separated occupational 
greupi aore atfectlvely than Itema on tha Strong Vocational Interest 
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BUnk which utilized other types of content. Kuder (4 » 70) arguee 
pertuaslvely against the use of occupational titles. 

Now we need to deteroine if sexual stereotyping of occupations 
is more closely related to one type of content than another. For 
instance, if an occupational name like "Physician" is perceived as 
Msculine. while an activity like "watch open heart surgery" Is not, 
using items based on occupational activities way be a subtle way of 
coobating the culturally induced sexual stere typing which each 
individual brings to the testing situation. 
Item Format 

There has been considerable controversy about item format or the way 
the item is presented (Kuder, Dolliver, 1968; Campbell, in press). 

The most popular formats include presenting one item and asking the 
respondent to endorse it or not and presenting, two or three items and 
asking the respondent to choose betwaen them or rank them. Both for- 
mats have been accused of reducing validity. If men and women respond 
to item formats differently, item format can contribute to bias. 
Sfong(1943) presented evidence of minimal differences in response bias 
bet%reen men and women, but it is not clear whefner they responded to the 
same items. The issue is complex and of leaser importance than some of 
the others, but it might be investigated. 

Is-^ues Related to Scale Df^velopment 
The task in developing scales for an interest inventory is to 
provide scores which will reflect the interests of an individual in a 
way that is useful in career planning. Obviously scales which contain 
sexual bias cannot adequately fulfill this function. Given the fact 
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that scales are built from Items and Items do differentiate between 
the saxes » it la Important to explore the Impact of sex differences 
on the scales » 

Scales Based on External Criteria 

For Interest inventories utilizing external criteria in scale de- 
velopment» the crucial issue is what the criteria should be* Usually^ 
the criteria is membership in some occupational group — by sex. There 
are two subissues» however. First » there is the question of whether it is 
useful to employ separate criterion groups of each sex where members 
of both sexes are readily available. Second, there is the question of what 
to do when members of only one sex are available for a criterion group. 

Se parate sex scales . The belief that men and women in the same occupa- 
tion do different things has been expressed by Johnson (1970). Intuitively 
his assessment of the situation seems correct; male social workers become 
administrators and teachers* while female social workers see clients; male 
lawyers practice corporate law while: female lawyers practice social law. 
We do not ]u v whether these arrangexoents of people within occupations 
are political or whiithkv they are based on real sex differences in 
interests. 

Both the Strong Vocational Xnterefic Blank and the Kuder Occupational 
Interest Survey scales are based on separate criterion groups of males 
and females (even if it requires two scales for one occupation). The 
Implication is that sex differences are very important in occupations 
and In measuring Interests. In the study (Johansson and Harmon, 1972) 
where large numbers of items were found to differentiate between men 
and women in the same occupation^ a further analysis wss performed. 



13 

The investigators found that a relatively small proportion of the 
Iteaa which differentiated between the sexes In an occupation actually 
appeared on the male or female Strong Vocational Interest Blank scale 
for that occupation^ Thus, the effect of -.ex differences in item 
response is not as great as It appears from the number of such items 
because they do not all appear on occupational scales. Johansson and 
Harmon concluded that it might be possible to build good scales by 
combining males and females in an occupation and comparing their item 
responses with those of a combined in-general or reference group. 
Since the Kuder Occupational Interest Survey uses all the items on 
each scale (Zytowski, 1973), male-female item differences might have 
a greater effect on scales if criterion groups of people in the same 
occupation were combined « 

Combined sex scales > Some preliminary work on building occupational 
scales baaed on both sexes is being done by the author and Pat Weber of the 
Center for Interest Measurement Research. The new Strong Campbell Interest 
Inventory has a common item pool so the effect of male- female item 
differences on scales can be studied dlr^tly. Three tentative scales 
were developed; a scale utilizing the comparison between male 
veterinarians and a male reference groups a scale utilizing comparisons 
between female veterinarians and a female reference group, and a 
scale utilising the comparison between a criterion group of male and 
female veterinarians weighted equally and a reference group of males 
and females weighted equally. Table 1 lists the items from the 
Occupations section (Items 1*131) which appeared on each of the three 
tentative scales* Eight items appeared on all three scales , twenty 
items appeared on two scales (all twenty on the scale for combined sexes 
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and ten each on the maie and female scales), five Items appeared on 
the male scale only, seven appeared on the feaale scale only, and one 
appeared on the combined scale only. These findings demonstrate that 

for veterinarians it is possible to develop an occupational scale for 
combined sexes which has an adequate number of items and which will 
correlate with both the single sex scales. Appropriate vallditjr 
studies have yet to be computed. Whether or not other occupational 
•calea can be derived in the samft way is not clear. Strong (1943) 
studied this problem by scoring men and women on scales for both 
sexea. He concluded that the man's and wonen's scales could be 
combined in some cases but not in others. Interestingly enough, he 
did not try it. More research is still needed. If valid scales can 
be built using combined criterion and reference grouos, it will not mean 
that the Interests of males and females in an occupation are the same, 
but that sex is not an Important enough part of the criterion to 
require a separate scale. 

Weighting aexea in criterion groups. Another kind of problem arises in 
building occupational scales where there are only a few members of one sex 
employed in the occupation and/or available to be in the criterion group. 
The procedure used in scale building should minimize the effect of male- 
female differences in item response unless it can be demonstrated that they 
are Importantin building an effective scale. Although the idea of composing 
each criterion group of the same proportion of males and females as are 
found in the occupation makes some sense at first, further reflection 
shows that it is only a modification of the approach already in use. 
If 90 percent of an occupation is composed of males and that proportion 
is maintained in the criterion group we would expect the scale to be 
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more representative of men's interest than of women's interests* 

Actually, it is not even that simple where the Strong Vocational 
Interest Blank is involved. Since the method employed literally 
removes the interest common to one sex or the other from consideration 
(DOlUver. 1968), atypical interests for males may be typical interests 
for females and result in findings like those of Stanfiel (1970) who 
showed women obtained more high scores on the men's form than the 
women's form of the Strong Vocational Interest Blank. Camobell (in 
press) presented data which showed that males score higher on many 
scales developed for females than on the scales developed for malos in 
the same occupation whilu females scored higher on many scales devoloned 
for males than on the scales developed for females in the san^ occupation. 
It might be possible to apply a sort of correction factor if the 
differences were consistent across all occuoations. Unfortunately, 
there is considerable variation in the ainount of difference between mrn 
and women in scores on male and female occupational scales. 

We can conclude that the best way to minimize sex bias is to weight 
males and females equally in criterion groups and reference groups even 
if that is not the proportion which reflects the status of the occupation. 
Schlossburg and Goodman (1972) have indicated that there are enough 
people of the "opposite" sex in many occupations to provide adequate 
criterion groups of males and females in atypical occupations. Criterion 
groups are usually selected by finding some concentrated source of 
people committed to their occupation, such as a professional organization, 
union, or licensing board. Scarcity of members of one sex in an 
occupation dictates a more extensive and more expenf^ive search for 
criterion group members. While the number of individuals needed in a 
criterion group varies in a way that is not predictable beforehand. 
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200 individuala li probably « reaioiuible minimum (Harmon, 1968). To 
provide a good criterion group with both aexea reapreented equally It 
la probably neceaaary to have a nlnlnim of 200 Indlvlduala of each aex. 
If thla minimum la met, the Item reaponae percentage for each sex 
ahould be averaged. 

One aex aealaa * Thla proc«dura will not worlj where leas than 200 
paraona of each aex are avo^ilable, ao the queatlon of whether to build c alea 

a alfigle aex (and how to uaa the many which have already been built) 
arlaea. It aeema unreaaonable to build more acalea based on one sex 
when they may contain aex blaa* After equal opportunity has been on 
operation for a few decades we may be able to determine If there are 
occupation«t which only attract members of one aex. Until then, our 
efforta can be expended In building occupational acalea baaed on 
criterion groupa of both sexea. 

Where acalea have already been developed ualng criterion groups 
of one aex, we ahould probably continue to use them for the appropriate 
aax until they can be converted to acalea for both aexea. There has 
baen conalderable controveray about whether to uae acalea developed 
ona one aex with the other* Darley and Hagenah (1955, p. 71) suggested 
ualng the men 'a form of the Strong Vocational Interest Blank in 
conjunction with the wotoen'a form for women who have a "high degree of 
career antlvatlon, maturity, and ability..." Strong, (1943, p. 576) 
waa convinced that "it la much better to acore a sex on its own scales." 
Tha Aaaoclation for Meaauremant and Evaluation in Guidance Cosamlssion 
on Sax Biaa in Meaauremant (1973) took the poaitlon that using "wrong" 
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sex scales Is psychooictrically meaningless. The data from Stanflel 
(1970) and Campbell (in press) suggest they are right, 

Kud«r (1970, b) showed that there are respectable correlations 
(medians from ,7^-. 81) between women's scores on thirteen sets of Occupational 
Interest Survey scales for men and women in the same occupation. Although 
the level of their scores on the male and female scales might "differ 
considerably", "scatea that yield the highest scores in one set generally 
yield high srores In the other*" Thus, it appears that one can use "wrong" 
•ex scales ott th« Kudet Occupational Interest Survey if one f^xplores thp 
highest icores on those *'wrong" sex scales comparing their content but not 
their level with that of the highest scores on the correct sex scales. 
Thla nrocedure cannot be generalized to inventories with scales which were 
not developed «• the Kud^r Occupational Interest ^rvey scales were developed 
Without additional study. It would seem more fruitful to exert thm same 
effort in developing new scales wherr scales based on one spx «re rh*» only 
•cales available. 

Note that th»»te .is seldom a problem over testing men on occupational 
scales developed on criterion groups of women. The scales which are available 
for wotien but not for men do not represent the most prestigious or highly 
paid occupations In our society, which may explain why B»n do not need to 
«*se thew. Alternatively, the cultural prohibitions against nten being 
Interested in "woaen's" occupations may be much stronger than cultural 
prohibitions against women being interested in "n^n's" occupations. 

Criterion group st ability over time . Since collecting criterion groups 
la expensive, It la reasonable to ask whether, once collected, scales based 
on them can be appropriately used over long periods of time. Campbell (1971, 
Chapter 9) presented four different kinds of evidence that the people in 
various occupations do not differ much over 30 years in their measured interests. 
Only one of the studies involved women subjects. How<:ver, the evidence strong- 
ly suggested that people in occupations have a common set of interests 
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whtch do „« Change ^ch o«r tl,^. Ih,. ti„ai„g p^^ably apnlu-, equally 
to men and women • 

General change, 1„ the popularity of Strong Vocational Interest 
Blank Ite™ over tt» have been docu«„ted for uo^n by Hill and C,»pbeU 

and for »e„ by Campbell (1968). since Interests of those In 
occupation, are «re stable, these findings apply «,re directly to the 
composition of reference or In-general groups. If reference group, are 
to be used the problem of cultural change In item response must be 
investigated further. Unfortunately, studies of differential rates and 
types of cultural change between men and wos^n have not been done as 
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t'ey require longitudinal date and a cosnaon item pool for men and woiaen« 
Apparently, researchers on the Kuder inventories are in a position to 
provide this inforxaatlon earlier than any other researchers* 

The foregoing discussion leaves unresolved the question of what 
to do If there la only an occupational scale for the 'Wong" sex or 
no scale at all available where one is needed in counseling* The 
availability of occupational scales can be a source of sexual biaa» in 
that one can be discouraged frofn considering a career or a whole set 
of careers by the absence of an appropriate interest inventory scale* 
The only solution to this problem in both the long and short range 
comes from the use of homogeneous ^^cales based on internal criteria^ 
since occupational scale building will always lag behind changes which 
occur in the world of work« 
Scales Based on Internal Criteria 

Homogeneous scales appear on the Strong Vocational Interest Blank 
and the Mimiesoi:^ Vocational Interest Inventory, which also have scales 
based on external criteria, as well as on dozens of interest Inventories which 
have homogeneous scales exclusively^ The set of homogeneous scales available 
on a given Inventory suggest a model of the organization- o' vocational 
interests* 

Many researchers have addressed the question of the factorial structure 
of Interests • Two of the earliest studies, by Thurstone (1931) and Crlsaey 
and Daniel (1939), found somewhat different factors for men and women* 
Both studies found ••science", "language^', and ••people** factors but Thurstone 
found a •'business'' factor for men while Crissey and Daniel found an "interest 
in male association'* factor for women. Scales which contributed to the lat- 
ter Included: housewife, office worker, stenographer, and nurse* 
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Hore recently, factor studies have concentrated prloiarily on the 
structure of men's interests as reviewed by Super and Critis (1962). 
So have attempts to understand the conflicting information which sometimes 
arises from the use of two inventories (King, Norrill and Powers, 1963; 
Zytowski, 1968, 1969, 1972; Kuder, 1969; 0*Shea and Harrington. 1971, 
0*Shea Lynch and Harrington, 1972). 

The most promising data on the structure of interests is that of 
Holland (Holland, Whitney, Califorroia and Richards, 1969.) and Cole 
(1973; Cole and Hanson, 1971; Cole, Whitney and Holland, 1971; Cole and 
Cole, 1970) in which a structure is proposed, a methodology established, 
and applications of the structure to occupations, to men's interests 
and to women's interests are substantiated. The results show that 
there is an interest structure coiamon to men and women which corresponds 
to the circular two-dimensional arrangement of interests proposed by 
Rol (1956) and Holland, et al (1969). Both inventories based on 
internal criteria and inventories based on external criteria were included 
in Cole's analyses. 

Cole (1973) suggested an extremely valuable method of surcounting 
the problems of sex bias Inherent in using sets of externally referencpd 
scales which have no scales for occupations women do not traditionally 
enter. It is to use the available scales from these inventories to 
locate the Individual's interests in the circular structure and to 
generalize from that position to the full range of occupations which 
occupy nearby positions in the structure. Homogeneous scales on the 
Vocational Reference Inventory (Holland, Hf5), ACT's Vocational 
interest Profile (1972), and the new Theme scales on the Strong Campbell 
Interest Inventory (Campbell, In press) correspond directly to the 
structure <.nd can be used to locate individuals' Interests in the structure. 



20 



The process does rest on an important aseumptlon which *ould 
be investigated. Since the occupational locations in the structure 
are based on the vocational preferences of young people, not employed 
adults, the method does not possess any dejionstrated predictive validity. 
The work of Campbell and Holland (1972) and Hansen and Johansson (1972) 
established concurrent validity for scales related to Holland's model 
(1966, 1973) which correspondt to Cole's structure, by showing that they 
separate occupational groups. Like most homogeneous scales, they do 
not differentiate as well between occupational criterion groups as 
occupational scales. However, there is enough data regarding the 
construct validity of Holland's formulation (Holland, 1973, Chapter 5) 
to suggest that more evidence of predictive validity will be forthcoming, 
^t the present time. Cole's method for using interest inventories 
appears to be the best way to procede, but predictive validity must be 
established. 

In general, hoiaogeneous scales have not been shown to have nuch 
predictive validity. Zytowski (in press) reviewed studies which have 
attempted to establish predictive validity for homogeneous scales. He 
conducted a study of the Kuder Preference Record utilizing two methods. 
His conclusions were that the Kuder (Form ^ was predictive of future 
employment but not accurate enough to use in individual counseling. 
His subjects were men but his study illustrates the point that the 
type of instrument which has least potential for sex bias because of 
conaon item pools and common scales for both sexes has the least 
predictive validity (Harmon, 1973). 
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Issues Kelatod to Norma 
Norms for interest scales with external criteria are usually 
Implied by that criteria. If the criterion group is made up of persons 
of one sex so is the norm group. The same issues about th« appropriate- 
ness of this procedure arise in discussing both norms and scale 
construction. The inappropriatenesa of scoring individuals on scales 
developed on the "wrong" sex was discussed earlier. The only way to 
insure that the norms on interest inventories based on nxternal 
criteria are not sexually biased is to make sure that the scales are 
not 9€xually biased. 

Norma for homogeneous scales based on internal criteria are 
usually developed by scoring a group of people within some age range on 
the scales. Most homogeneous scales such as those on the Vocational 
Preference Inventory, the Kuder (Form C), the Ohio Vocational Interest 
Survey, and the Strong Vocational Interest Blanks have separate norms for 
men and women. While norms do not confer validity, they are used to compare 
individuals with a group which will provide a meaningful comparison. In 
the case of interest inventories these comparisons are used as a basis for 
dlsrussion about future plans, D'Costa (1972) and Goldman (1972 (a)) hav*» 
advocated that interest inventory results be used as aids in discussion 
and vocational exploration. However, as soon as they are used predictively , 
the norms become very important. Me know that it is psychometrically 
defensiveable to provide separate norms when the scores for two grouos 
(for instance, male and female) differ appreciably. However, the practical 
meaning of S'»x- segregated norms is obscure. For Instance, if a high school 
senior woman scores high on a homogeneous scale for Machine Work compared 
with a high school 



22 



senior wotnen how should she Internrot this inforinationl One might 
guoss that she would score lower on norms for high school senior in**n 
but on the Ohio Vocational Interest Survf*y rr>port form (whore a 
Machine Work scale appears) the ouile norms are not available to her* 
Should she be encouraged to consider nachine work because she scores 
high compared to her sanv? sex neers, encouraged not to consider machine 
work because she probably scores lower than her opposite sex peers or 
encouraged to consider her interest in machine work compared to people 
who do and like machine work. In the latter case„ the norms offer her 
no information at all* 

The problem is that general sex norms for homogeneous scales confer 
no criterion related validity but we tend to use them as though they do* 
It would probably be better to report scores on hoisogeneous scales simply 
as a code type> as in Holland *s Self Directed Search (1965, b), encouraging 
individuals to explore their strongest interests and avoid comparisons with 
other people comol^tely* If results of homogeneous scales are to be used 
predictivoly the scores should be validated^ Concurrent validity could 

established by establishing norm groups of employed workers in various 
occupations as with externally developed scales* However ^ the question 
of how the sexes should be represented in such norm groups is unresolved • 
They could be represented equally^ proDortlonately ^ or separately , and only 
further study will show how it should be done* It is certain to be an 
expensive and difficult task* 

Although cole (1973) developed her structure of women's interests 
utilising scales which wore developed and/or norc^ed for women only» the 
author feels strongly that it would be more fruitful to explore the predictive 
validity of her suggested procedure than to collect more occupational norm 
groups for hotnog^^neous scales* 

o 
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This type of research should be possible both at American College Testing 
Program and Center for Interest Measurement Research* Both have a large 
mature bank of data on Individuals ^ in the sense that the data are old 
enough to support meaningful longitudinal studies* The eventual occupations 
of students who took the Vocational Interest Profile at ACT can be ascertained • 
At the Center for Interest Measureinent Research the task is even easier, 
since older Strong Vocational Interest Blanks for ncople whose eventual 
occupations are knovn can be scored with the scales based on Holland's 
theory (Campbell and Holland, 1972; Hansen and Johansson, 1972). 

If the use of hosaogenpous scalt^s normed for woinon and the occupational 
structure presented by Cole are predictively valid they should be used. 
The use of code types to locate an individual in the occupational structure 
without further reference to norms should also be <>xplorod. If these procedures 
do not prove to be predictively valid » we will have to norm homogeneous 
scales on employed groups in an attempt to estebltsh concurrent validity. 

While a did(.ussion of validity seems out of place in a discussion 
of norms » it is clear that norms on interest inventories have no purpose 
at all if they ar*^ unrelated to validity* 

The discussion of norms brings up the question of how age is relate 
to criterion groups and norm groups. Campboll (197l> Chapter 5) has 
demonstrated that there are a number of Strong Vocational Interest Blank 
items which differentiate between men of vfirious ages. He used their to 
develop an Agp Related scale. He was also able to identify scales where 
scores IncreaaeU decreased with age in a large fairly representative 
sample of men. Apparently, even though individual Strong Vocational 
Interest Blank scores are quite stable over time (Campbell, 1966, 1969) 
there are group differences in item response and scale scores over time. 
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Since vomen, at least at this tisie^ probably utilize vocational 
counseling on a different schedule than men. the (Effects of these differeu 
nay be Important. Aatln and ifyint (1971) and Harmon (1970) have shown 
that career women are difficult to distinguish from non-caroer women 
until the woman Is an adult, after the time when the information is needed 
for career planning. Many women do seek carepr counseling as adults with 
grown children, rather than as late adolescents. 
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It i« appropriate to aak whether Interest Inventories provide 
adequate meaaures of interest for "returnit*^" women. The question of 
appropriate norm groups and occupational groups for older woioen is 
crucial. 

Given the data presented by Campbell it would appear to be best 
to have norm groups or criterion groups composed of a wide range of 
ages. This Is usually the case in inventories based on external 
criteria since occupational status is aaore important than age. On 
inventories based on internal criteria* age as well as sex norms are 
often used. Few of them have adequate adult norms, high school 
and college norms being nuch more typical. Homogeneous scales can be 
used with returning women to establish an individualized order of 
preference among vocational dimensions without reference to norms. 
This preference may be more useful than comparisons which are potentially 
effected by age differences. These issues should be considered in 
developing interest scales to be used with women because of the 
different life patterns they are likely to follow. 

Interpretive Practices 
These are obviously complex issues which it is important for the 
counselor to understand. Some of the problems can be alleviated by 
choosing the most appropriate inventory for each individual. If the 
client is a 14-year-old girl the problem is quite different from that 
of a 50-year-old woman, and it would seldom be appropriate to select 
the same interest inventory for both. In general, it is important to 
select an inventory with an unbiased item pool. The counselor should 
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not reinforce sexual stereotyping by presenting the client with test 
materials which appear to divide the world of work Into two classes^-caale 
and feosale^ The report of scores or interest profile should not contribute 
to sexual stereotyping either^ but report forms for most of the major 
Inventories are currently unable to meet this criterion* They either 
contain separate scales for each seK» or separate norms for each sex» 
Since we are not sure at this time ^^hethar separate norms and separate 
scales are necessary » since there almost certainly are cocs'>lex influences 
at work to make the use of '"wrong" sex scales and norms questionable > 
how to present Interest inventory scores is a problem* It seems silly 
for psychol( ^ists to have to admit to clients, that our professional 
eyesight has been afflicted by a form of double imagery, (seeing the 
world of work as a sexual dichotomy) which has resulted In a set of 
psychological instruments which we now realize (being on our way to 
better eyesight) are largely unlnterpretable* The beat proposal before 
us seems to be to use the Instruments available to assign women clients 
to a position In Cole^s (1973) structural model (*:aich approximates 
Hoi land » Whitney ^ Cole and Richards* theoretical hexagon). This procedure 
will require that the practicing counselor keep a close eye on the 
research literature of vocational Interest measurement. If some 
evidence of the predictive validity of this approach does not appear 
within the next five years, it should be abandoned* 
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Two Psychometric Fantasies 

Suppose that the origins of interest roeasureiaent lead back to a 

group of men and women who were not operating under the assumption that 

some jobs are men's jobs and other jobs are women's jobs. They would 

have devised a pool of items to differentiate between people in various 

occupations and people in a general reference group or to differentiate 

people who had high interest in one interest factor or cluster from 

people who had high interest in another. Certainly, this idea could 

have occurred in the 1920* s when the early feminists had scored a oajor 

victory and earned the vote. If sex had been an iniportant variable 

In measuring Interests in various occupations or interest factors, the 

researchers would have found out early. If they started with the 

assumption that all work is available to all people, they would not 

have relegated the problem of how to treat real sex differences to 

unimportance. Instead, they would have worked to insure that sex 

differences entered the measurement process only where they were 

really appropriate and had something to contribute toward helping 

Individual* learn about their interests and how those interests relate 

to Jobs. But this fantasy is too idealistic, one cannot change the past. 

One can influence the future, so it may be more productive to 

fantislie in that direction. If I were an unlimited grant today 

to build a new interest inventory which would minimize sex bias, how 

would I precede? 
Itema 

Obvloualy, I would avoid items which imply an "appropriate" 
aax. I would attempt to determine whether items containing occupation- 
al names or occupational activltiea elicit more responses based on sexuaJ 



27 



stereotypes. Data is being analysed now which taay help answer this 
question. Conroe (1974) asked 50 male and 50 feiaale college students 
to respond to the Strong Campbell Interest Inventory in a "typically 
masculine" and "typically feminine" manner. Since the Strong Campbell 
Interest Inventory contains both types of item content, it will be 
possible to study the responses for each condition to compare which 
class (names or activities) elicits the largest differences between 
them. Actually, the data will allow for comparisons between all the 
types of content on the Strong Campbell Interest Inventory, 

Since I know that some items are more highly endorsed by men than 
by women, I would want to balance the number of each type in my 
Inventory. (Association for Measurement and Evaluation in Guidance 
Connlssion, 1973). This would avoid the appearance of bias toward 
one sex or the other and contribute to face validity for both sexes. 
It would also insure that a preponderance of items more highly endorsed 
by one sex or the other on any scales could not be attributed to the 
base rates or number of such items available in the item pool. 
Scales 

Since there is no clear answer as to whether scales based on external 
or Internal criteria are best» I would build both types. First > I would 
administer my items to a large randomly selected group of adults and identify 
interest factors or clusters* I would expect that factors similar to the 
types proposed by Holland (1966) would appear. It is clear from Holland* s 
review (1973) that many researchers are finding that both interest and occu- 
patlonal'data fit his model. 

Then I would build some occupational scales. I would attempt to 
select occupations for study which; 1) are least likely to Imply sexual 
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seterotyping when their names appear on a profile or report forw, 

2) are representative of major areas of interest, and 3) are representative 

of major levels of occupations. Dian»nd (1971) has shown that sex 

differences are more impor tant at low occupational levels than at higher 

occupational levels, 1 would avoid occupational groups with extreme 

splits in sexual composition. Xf a group had less than 207. of one sex 

or the other I would avoid building a scale for it. My rationale is 

both psychometric and political. Building scales tor one sex occupations 

introduces problems of sex bias in scales and norms. A 

scale for "able-bodied seamen," based on a sample of 200 men in the 

occupation, is more likely to include or weight heavily a number of 

items which differentiate men from women than such a scale based on 

200 women and 200 men. This problem might be solved Ir one had AOO 

men and 100 women by weighting the responses of each sex equally, but 

why build a scale for an occupation which is* essentially a "male" 

occupation? The answer is to encourage more women to enter it. Analogously 

we should build a -ales for cccupations like child care worker to 

encourage more men to enter that occupation. This argument illustrates 

how the psychometric and political aspects of the problem interact in 

decision making. This phenomenon has, no doubt, been at work in 

interest measurement technology for years without anyone taking 

particular notice of it. 

A general reference group for use in selecting items might be 
derived from the general population group used in developing homogeneous 
scales. Both Strong (1943) and Campbell (1971) have described problems 
in formulating a general reference group. Occupational level and 
culture as related to year of testing have been shown to effect the 
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reference group* Thus, the beet reference group for aelectlng or 
weighting it^ms on the occupational scales would becoiae a subject for 
investigation. 

Reliability 

The scales developed must measure reliably over time if they are to be 
put into use. Appropriate reliability studies would be done since it has 
been demonstrated that interests can be measured reliably for both men and 
women (Campbell, 1966; 1969) • I would anticipate no problems in this area* 

Norms 

I would provide general population norms for the homogeneous scales. 

At least high school, college age^ and adult norms would be needed. The 

procedure used in item selection should minimize sex differences on these 

scales but whether or not they would eliminate them is a question which the 

data must answer. My guess is that I would still find women scoring * --^^v 

somewhat higher on the social or some corresponding scale and meu scoring 

higher on mechanical or its corresponding scale. 

Whatever my findings > I would build a periodic normlng procedure into 

my organizational and fiscal structure. Too many test developers provide 

new norms only after it is clear that the old ones are outmoded or inadequate 

Realistically, this practice is understandable, but it contributes to poor 

testing practice. Ethically, it seems wiser to build the cost of up-to-date 

norms into the charge for the testt because it is an Important service Lc 

the user. 

For the occupational scales I would provide occupational norms. The 
evidence suggests less change over timewithin occupational groups than with-- 
in total population so I would be less concerned about renormlng these 
scales. 
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Validity 

The occupational scales would presumably have concurrint validity 
because they were developed using occupatioaal groups. Tiltoa's over- 
lap (1937) would be computed between each occupational group and the 
general reference group to establish the concurrent validity of the oc- 
cupational scales. Concurrent validity for the homogeneous scales, which 
\«ould hopefully be related to Holland's types, could be established by 
scoring the occupational samples on them. The high scores of ealch occu- 
pational group would be expected to correspond to those listed for the 
occupation in Holland's Occupations Finder (1970). 

A major concern would be whether the developed scales predict 
future occupational behavior. The only way to examine future behavior, 
assuming that the items selected for the scales had never been used 
together before, is to do a large scale longitudinal study. Ideally, 
the subjects originally studied would be of various ages, not Just 
college freshmen. On followup, the predictive power of both types of 
scales, homogeneous and occupational, would be assessed by age at 
first testing and by sex. The basic data might take the form suggested 
by Figure 2. Hit rates or proportions of correct predictions would 
be calculated. Two sets of decision rules would be necessary, one for 
determining how individuals would be placed in each predictive category, 
and one for determining what kind of vocational behavior would be 
declared a correct prediction. Individuals could be placed in predictive 
categories on the basis of their highest scores, on the basis of scoring 
high on « scale compared to the general population, or on the basis of 
scoring high on a scale compared to people In a relevant occupation. 
Actually, calculating hit rates using various sets of decision rules 
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for making predictions would also provide valuable information about 
the various interpretive strategies* 

Decision rules for assessing hits based on hotnogeneous scales 
might be based on predictions derived froui either Holland S Occupations 
Finder or Cole^s structural model as criteria^ although a failure to 
find a high proportion of hits in this case might be interpreted as a 
lack of validity in either the predictor scales > the criteria, or both. 
It appears to be such easier to assign a hit or correct prediction 
when occupational scales are used because it is easy to tell whether 
the individual is in the occupation named by the scale or not. However, 
hits have usually been attributed to occupational scales when an 
Individual entered the occupation named by the scale or a closely 
related occupation (Campbell, 1966 (b); Harmon, 1969) • 

One problem which must be addressed in assessing the predictive 
validity of interest inventories for women is that not all wom&n work* 
The nutiber and proportion of working women is Increasing (U.S. Department 
of Labor, 1969) but Kriger's recent research (1972) suggested that the 
basic career decision made by women is whether or not to work. Perhaps 
my interest inventory should have a scale to predict orientation toward 
career commitment or away from it for both women and men. Schlssel 
(1968) was able to build such a scale for women using the Strong 
Vocational Interest Blank for men. If a career orientation scale were 
available, non-working could be a predicted criterion. However, until 
the care of young children is shared equally by iralos and females in 
our society, there will probably be many young women with high career 
aspirations at home caring for children. 
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Harmon (1969) employed a strategy for assessing predictive validity 
which rettiDved non* career women from consideration and gave career 
woman who were temporarily out of the job market credit for their 
career commitment* In the proposed fol low-up > the same latitude 
could be extended in assigning men to criterion categories* 

A related problem is that many women do, unfortunately, enter 

occupations out of convenience or need* A large proportion of families 

with incosoes bejow the poverty level are headed by women (United States 

Department of Labor ^ 1969)* Few of them, left with families, can 

pay much attention to their interests and to long range planning. 

Circumstances like these also befall men but they seem oore common 

for women* If they are, in fact, more common for women, we would expect 

a longitudinal follow*up of the predictive accuracy of an interest 

inventory to show better prediction for men than women* Thus, if society 

offers a wider range of choices to one sex or the other, we might 

expect better prediction of job criteria from interests for that sex, 

unless of course, there is an interaction between the opportunities 

society offers and the development of interests. Both Roe (1956) and 

Holland (1973) suggest that there is such an interaction. To explore 

cultural change and its interaction with interests our follow-up studies, 

as well as norming efforts^ would have to be put on a schedule* Actually, 

they might go hand in hand, with new norm groups tested every five 

years and followed up 10, 15 and 20 years later* 
Interpretation 

Remembering that this fantasy is predicated on unlimited funds, 
I would do some unusual things to Insure that the Interest Inven- 
tory I had developed would be well used* I would substantially 
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discount the cost of roaterials and scoring to every qualified user who 
attended regular workshops on its use and/or passed periodic examinations 
which showed that they understood vocational interest technology in 
general* and new developsaents in particular. Goldoian (1972, a; 
1972, b) has so despaired of the counselor's conjpetency in using tests 
that he has advocate! that counselors stop using them. I would attempt 
to educate counselors, instead. 

Interpretative materials available for clients would stress the 
int'»ractlon between culture and testing. They would point out the 
necessity for renorming and the possibility that people in general 
might change over the next five years. For homogeneous and occupational 
scale%they would clearly present what validity data is now available 

T 
f 

and what is needed in the future to be more certain about the ^Aredictionfc? we 

make. They would give the client enough information about what la and 

is not known ^ to make a decision as to how nuch weight to place on the 

scores. This procedure would he difficult to implempnt but it puts 

the responsibility where it belongs » on the client. The counselor 

whose needs are met by knowing mor^ than the client and grandly unraveling 

the mystery before the clients* eyes, would not be very comfortable with 

either the ambiguity or the appeal to the client as counselor which 

these materials would contain. 

Epilogue to the Fantasy 

All this ambiguity makes me quite uncomfortable, too* None of the 
elements of my fantasy are new or terribly creative, but I am impatient 
because the answers are not yet available. We do not know the effect 
of balancing items in an interest Item pool according to sexual re- 
sponse item differences. We do not know the effect of developing 
occupational scales with criterion groups of men and women, equally 
weighted « We do not know whether Cole^s method of locating clients 
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In a structural iDodel has predictive validity. We do not know ouch 
about how interests develop and change* While the ainblguities isake 
Bie intellectually uncomfortable, I am even i&ore uncomfortable with my 
clients* 

None of the measurement procedures available to me provide me 
with both a lack of sexual bias and good predictive validity* Apparently » 
X will have to live out lay own fantasy and be absolutely honest with 
my clients about the sexual bias in items » scales » and norms in the 
interest inventories I ask them to complete, and about the uncertain 
validities associated with using *H«rong'' sex scales or with using 
ColeS structural model* 

When a fantasy is shared, what is the expected result? Approval, 
argument, or attention? This author will settle for action* I am 
not so concerned that we settle the questions I have raised in the 
direction I have predicted, but that we investigate whether our view 
of the world of work as dichotomized by aex is really necessat^ in 
interest measurement* This basic question i^ust b^ answered before we 
know how to define sex bias in interest measurement more clearly and 
eliminate it more surely* To that end the following guidelines are 
dedicated* 

Guidelines 

Zn summsry, the following guidelines can serve as both a means of 
aasessing current interest inventories for sex bias aa well as a plan 
for action for developing new interest inventories which are free of bias* 
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1, The content of Interest inventory itetna should not Imply that any 
occupation or activity is nore appropriate for one sex than the other, 

2, The pool of Interest Inventory items which make up an interest 
inventory should be appropriate for both sexes and used for both sexes. 

3, Because there are sex differences in item responses, the item pool 
should contain equal numbers of items which arp more highly endorsed 
by men than women and Items which are more highly endorsed by women 
than men. 

A. Groups used in developing scales should be composed of men and women 
In equal proportions or the effects of both groups should be 
statistically equalized. 

5, Scores on the sarof? scales should be "^available for both men and 
women (not Just scores on scales which have the same names but which 
use dlffcrc^it Itema). 

6, Norms on homogeneous interest scales should not be presented by 
age or sex without evidence about the predictive value of a 
"high" score as related to such norm groups, 

7, Norm groups for occupational scales should contain men and women 
In equal proportions or the effects of both groups should be 
statistically equalized. 

8, Publishers of interest inventories should have a preannounced 
plan of periodically renorm^ng homogeneous scales, 

5. Publishers of Interest inventories should have a preannounced plan 
for periodic studies of the predictive validities of both honogeneous 
and occupational interest scales. 
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10« Published interpretive materials should indicate clearly that 

sex norms for homogeneous scales do not imply anything about the 
predictive validity of the scales and encourage clients to evaluate 
their scores in relationship to both sexes. 

IK Published interpretive materials should indicate clearly that 
uaing occupational scales developed for one sex with the other 
sex is of questionable validity because the underlying technology 
tends to maximize sex differences unduly* 

12* Publishers of interest Inventories which do not meet th<*ae guidelines 
should show evidence that attempts are being made to define » study » 
and eliminate sources of sex bias in their instruments* 
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Table 1 

Iteot Appearing on Three Tentative SCIX Scale* 
For Veterinarians (Xtens 1-131, Occupations) 



Direction "Like" 


Scales on Which Items Aooear 


Items Is Scored 


F (237.) 


M (i8r.'> 


F&M (157.) 


2 Advertising 




X 


X 


Executive 




• 




10 Auctioneer + 




X 


X 


13 Author of technical 


X 




X 


books 








19 Biologist + 


X 


X 


X 


29 Church worker -f 


X 






30 City or State 


X 


X 


X 


Employee 








31 City Planner 




X 


X 


34 Computer Operator 




X 




41 Dentist + 




X 


X 


42 Designer, Elec- 




X 




tronic equipment 








43 Dietician 




X 




46 Editor 




X 




49 Elementary School 


X 




X 


Teacher 








50 Employment - 


X 


X 


X 


Manager 








52 Farmer + 


X 


X 


X 


59 High School 


X 


• 




Teacher 








60 Home Economics 


X 




X 


Teacher 








70 Labor Arbitrator 




X 




71 Laboratory + 


X 




X 


Technician 








78 Manager Child 


X 




X 


Care Center 








79 Manager Women's 


X 




X 


Style Shop 








85 Newspaper Reporter - 




X 


X 


86 Nurse + 




X 


X 


88 Office Clerk 


X 






89 Office Manager 






X 


92 Pharmacist + 




X 


X 


94 Physician + 


X 


X 


X 


95 Playground 


X 






Director 








98 Politician 




X 


X 



Itetna 



99 Private Secretary 
lOA ^bllc Relations 

Director 
105 Rancher 
107 Receptionist 
109 Sales Manager 

111 Scientific 
Illustrator 

112 Scientific Re- 
search Worker 

113 Social Worker 
121 Surgeon 

127 Vocational 
Councelor 

130 X-Ray Technician 

131 YH:a/YWCA Staff 
Menber 
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Figure 1 

Responses to the Item "Biologist" 



Like Z Indifferent % Dislikp % 

Veterinarians (Female) 90 9 1 

Women* in*General £^ 12 Zl 

Differences +44 - 20 - 24 

Scale Weights +1 - 1 - I 
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Figure 2 

Predictive Validity of Interest Scales 
Established by Hit Rates 



Predictions 
(by scale) 



Number of 
Predictions by 
Age Group 
At Testing 



N 



Social (homogeneous) 



High School 
College 
Adult 
Total 



Social (Occupational) 



High School 
College 
Adult 
Total 



Investigative (homogeneous) High School 

College 
Adult 
Total 



Scientific (Occupational) 



High School 
College 
Adult 
Total 



etc. 



ERIC 



Percentage of Hits 
Sex 



Hale 



Female 



Total 




